Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 757349 |
| Missing cells | 1416017 |
| Missing cells (%) | 9.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 115.6 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Categorical | 10 |
|---|---|
| Numeric | 7 |
| Unsupported | 2 |
| Boolean | 1 |
State has constant value "Adamawa" | Constant |
Regimen has a high cardinality: 109 distinct values | High cardinality |
PHARMACY_ID is highly correlated with PATIENT_ID and 2 other fields | High correlation |
PATIENT_ID is highly correlated with PHARMACY_ID and 1 other fields | High correlation |
FACILITY_ID is highly correlated with PHARMACY_ID and 1 other fields | High correlation |
ADHERENCE is highly correlated with PHARMACY_ID | High correlation |
PATIENT_ID is highly correlated with FACILITY_ID | High correlation |
FACILITY_ID is highly correlated with PATIENT_ID | High correlation |
PATIENT_ID is highly correlated with FACILITY_ID | High correlation |
FACILITY_ID is highly correlated with PATIENT_ID | High correlation |
ADR_IDS is highly correlated with PHARMACY_ID and 5 other fields | High correlation |
PHARMACY_ID is highly correlated with ADR_IDS and 6 other fields | High correlation |
Regimen Line is highly correlated with ADR_IDS | High correlation |
AFTERNOON is highly correlated with EVENING | High correlation |
DMOC_TYPE is highly correlated with PHARMACY_ID and 3 other fields | High correlation |
L.G.A is highly correlated with ADR_IDS and 6 other fields | High correlation |
ADHERENCE is highly correlated with ADR_IDS and 5 other fields | High correlation |
PATIENT_ID is highly correlated with ADR_IDS and 5 other fields | High correlation |
EVENING is highly correlated with AFTERNOON | High correlation |
Facility Name is highly correlated with ADR_IDS and 6 other fields | High correlation |
FACILITY_ID is highly correlated with PHARMACY_ID and 5 other fields | High correlation |
ADR_SCREENED has 79955 (10.6%) missing values | Missing |
ADR_IDS has 757310 (> 99.9%) missing values | Missing |
DMOC_TYPE has 578741 (76.4%) missing values | Missing |
MORNING is highly skewed (γ1 = 160.802169) | Skewed |
BODY_WEIGHT is highly skewed (γ1 = 28.28863313) | Skewed |
PHARMACY_ID has unique values | Unique |
DATE_VISIT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NEXT_APPOINTMENT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
MORNING has 301222 (39.8%) zeros | Zeros |
EVENING has 201465 (26.6%) zeros | Zeros |
BODY_WEIGHT has 755118 (99.7%) zeros | Zeros |
Reproduction
| Analysis started | 2021-06-15 09:14:13.063728 |
|---|---|
| Analysis finished | 2021-06-15 09:15:41.151713 |
| Duration | 1 minute and 28.09 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| Adamawa |
|---|
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 5301443 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adamawa |
|---|---|
| 2nd row | Adamawa |
| 3rd row | Adamawa |
| 4th row | Adamawa |
| 5th row | Adamawa |
Common Values
| Value | Count | Frequency (%) |
| Adamawa | 757349 |
Length
Pie chart
| Value | Count | Frequency (%) |
| adamawa | 757349 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2272047 | |
| A | 757349 | 14.3% |
| d | 757349 | 14.3% |
| m | 757349 | 14.3% |
| w | 757349 | 14.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4544094 | |
| Uppercase Letter | 757349 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2272047 | |
| d | 757349 | 16.7% |
| m | 757349 | 16.7% |
| w | 757349 | 16.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 757349 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5301443 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2272047 | |
| A | 757349 | 14.3% |
| d | 757349 | 14.3% |
| m | 757349 | 14.3% |
| w | 757349 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5301443 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2272047 | |
| A | 757349 | 14.3% |
| d | 757349 | 14.3% |
| m | 757349 | 14.3% |
| w | 757349 | 14.3% |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| Mubi South | |
|---|---|
| Song | |
| Numan | |
| Michika | |
| Hong | |
| Other values (5) |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.093749381 |
| Min length | 4 |
Characters and Unicode
| Total characters | 5372444 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Girei |
|---|---|
| 2nd row | Girei |
| 3rd row | Girei |
| 4th row | Girei |
| 5th row | Girei |
Common Values
| Value | Count | Frequency (%) |
| Mubi South | 316231 | |
| Song | 120950 | 16.0% |
| Numan | 116869 | 15.4% |
| Michika | 91173 | 12.0% |
| Hong | 57359 | 7.6% |
| Gayuk | 38373 | 5.1% |
| Girei | 7252 | 1.0% |
| Maiha | 5737 | 0.8% |
| Demsa | 3236 | 0.4% |
| Madagali | 169 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| mubi | 316231 | |
| south | 316231 | |
| song | 120950 | 11.3% |
| numan | 116869 | 10.9% |
| michika | 91173 | 8.5% |
| hong | 57359 | 5.3% |
| gayuk | 38373 | 3.6% |
| girei | 7252 | 0.7% |
| maiha | 5737 | 0.5% |
| demsa | 3236 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 787704 | |
| i | 518987 | |
| o | 494540 | |
| S | 437181 | |
| M | 413310 | 7.7% |
| h | 413141 | 7.7% |
| b | 316231 | 5.9% |
| 316231 | 5.9% | |
| t | 316231 | 5.9% |
| n | 295178 | 5.5% |
| Other values (15) | 1063710 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3982633 | |
| Uppercase Letter | 1073580 | 20.0% |
| Space Separator | 316231 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 787704 | |
| i | 518987 | |
| o | 494540 | |
| h | 413141 | |
| b | 316231 | |
| t | 316231 | |
| n | 295178 | 7.4% |
| a | 261632 | 6.6% |
| g | 178478 | 4.5% |
| k | 129546 | 3.3% |
| Other values (8) | 270965 | 6.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 437181 | |
| M | 413310 | |
| N | 116869 | 10.9% |
| H | 57359 | 5.3% |
| G | 45625 | 4.2% |
| D | 3236 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 316231 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5056213 | |
| Common | 316231 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 787704 | |
| i | 518987 | |
| o | 494540 | |
| S | 437181 | |
| M | 413310 | |
| h | 413141 | |
| b | 316231 | 6.3% |
| t | 316231 | 6.3% |
| n | 295178 | 5.8% |
| a | 261632 | 5.2% |
| Other values (14) | 802078 |
Common
| Value | Count | Frequency (%) |
| 316231 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5372444 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 787704 | |
| i | 518987 | |
| o | 494540 | |
| S | 437181 | |
| M | 413310 | 7.7% |
| h | 413141 | 7.7% |
| b | 316231 | 5.9% |
| 316231 | 5.9% | |
| t | 316231 | 5.9% |
| n | 295178 | 5.5% |
| Other values (15) | 1063710 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| Mubi General Hospital | |
|---|---|
| Song Cottage Hospital | |
| Numan General Hospital | |
| Michika General Hospital | |
| Hong Cottage Hospital | |
| Other values (5) |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 21.51972208 |
| Min length | 14 |
Characters and Unicode
| Total characters | 16297940 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Girei B Clinic |
|---|---|
| 2nd row | Girei B Clinic |
| 3rd row | Girei B Clinic |
| 4th row | Girei B Clinic |
| 5th row | Girei B Clinic |
Common Values
| Value | Count | Frequency (%) |
| Mubi General Hospital | 316231 | |
| Song Cottage Hospital | 120950 | 16.0% |
| Numan General Hospital | 116869 | 15.4% |
| Michika General Hospital | 91173 | 12.0% |
| Hong Cottage Hospital | 57359 | 7.6% |
| Guyuk General Hospital | 38373 | 5.1% |
| Girei B Clinic | 7252 | 1.0% |
| Maiha Cottage Hospital | 5737 | 0.8% |
| Borrong General Hospital | 3236 | 0.4% |
| Cottage Hospital Gulak | 169 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| hospital | 750097 | |
| general | 565882 | |
| mubi | 316231 | |
| cottage | 184215 | 8.1% |
| song | 120950 | 5.3% |
| numan | 116869 | 5.1% |
| michika | 91173 | 4.0% |
| hong | 57359 | 2.5% |
| guyuk | 38373 | 1.7% |
| b | 7252 | 0.3% |
| Other values (5) | 23646 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1719879 | 10.6% |
| 1514698 | 9.3% | |
| l | 1323400 | 8.1% |
| e | 1323231 | 8.1% |
| i | 1283419 | 7.9% |
| o | 1119093 | 6.9% |
| t | 1118527 | 6.9% |
| n | 871548 | 5.3% |
| H | 807456 | 5.0% |
| s | 750097 | 4.6% |
| Other values (16) | 4466592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12511195 | |
| Uppercase Letter | 2272047 | 13.9% |
| Space Separator | 1514698 | 9.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1719879 | |
| l | 1323400 | |
| e | 1323231 | |
| i | 1283419 | |
| o | 1119093 | |
| t | 1118527 | |
| n | 871548 | |
| s | 750097 | |
| p | 750097 | |
| r | 579606 | 4.6% |
| Other values (8) | 1672298 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 807456 | |
| G | 611676 | |
| M | 413141 | |
| C | 191467 | 8.4% |
| S | 120950 | 5.3% |
| N | 116869 | 5.1% |
| B | 10488 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1514698 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14783242 | |
| Common | 1514698 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1719879 | |
| l | 1323400 | 9.0% |
| e | 1323231 | 9.0% |
| i | 1283419 | 8.7% |
| o | 1119093 | 7.6% |
| t | 1118527 | 7.6% |
| n | 871548 | 5.9% |
| H | 807456 | 5.5% |
| s | 750097 | 5.1% |
| p | 750097 | 5.1% |
| Other values (15) | 3716495 |
Common
| Value | Count | Frequency (%) |
| 1514698 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16297940 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1719879 | 10.6% |
| 1514698 | 9.3% | |
| l | 1323400 | 8.1% |
| e | 1323231 | 8.1% |
| i | 1283419 | 7.9% |
| o | 1119093 | 6.9% |
| t | 1118527 | 6.9% |
| n | 871548 | 5.3% |
| H | 807456 | 5.0% |
| s | 750097 | 4.6% |
| Other values (16) | 4466592 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| ART First Line Adult | |
|---|---|
| Cotrimoxazole (CTX) Prophylaxis | |
| Isoniazid Preventive Therapy (IPT) | 26129 |
| ART First Line Children | 21648 |
| ART Second Line Adult | 5215 |
| Other values (9) | 984 |
Length
| Max length | 46 |
|---|---|
| Median length | 20 |
| Mean length | 21.84627167 |
| Min length | 10 |
Characters and Unicode
| Total characters | 16545252 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ART First Line Adult |
|---|---|
| 2nd row | Isoniazid Preventive Therapy (IPT) |
| 3rd row | ART First Line Adult |
| 4th row | ART First Line Adult |
| 5th row | ART First Line Adult |
Common Values
| Value | Count | Frequency (%) |
| ART First Line Adult | 616043 | |
| Cotrimoxazole (CTX) Prophylaxis | 87330 | 11.5% |
| Isoniazid Preventive Therapy (IPT) | 26129 | 3.5% |
| ART First Line Children | 21648 | 2.9% |
| ART Second Line Adult | 5215 | 0.7% |
| OI Treatment | 341 | < 0.1% |
| ART Second Line Children | 222 | < 0.1% |
| ARV Prophylaxis for Pregnant Women | 117 | < 0.1% |
| Other Medicines | 114 | < 0.1% |
| Other anti-infectives (including STI Medicine) | 91 | < 0.1% |
| Other values (4) | 99 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| line | 643130 | |
| art | 643128 | |
| first | 637691 | |
| adult | 621316 | |
| prophylaxis | 87477 | 3.0% |
| cotrimoxazole | 87330 | 3.0% |
| ctx | 87330 | 3.0% |
| preventive | 26129 | 0.9% |
| isoniazid | 26129 | 0.9% |
| therapy | 26129 | 0.9% |
| Other values (18) | 55504 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2183944 | ||
| i | 1556761 | 9.4% |
| t | 1373816 | 8.3% |
| A | 1264591 | 7.6% |
| r | 887514 | 5.4% |
| e | 864139 | 5.2% |
| l | 818093 | 4.9% |
| T | 783284 | 4.7% |
| s | 751532 | 4.5% |
| n | 724092 | 4.4% |
| Other values (30) | 5337486 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9679095 | |
| Uppercase Letter | 4455022 | |
| Space Separator | 2183944 | 13.2% |
| Open Punctuation | 113550 | 0.7% |
| Close Punctuation | 113550 | 0.7% |
| Dash Punctuation | 91 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1556761 | |
| t | 1373816 | |
| r | 887514 | |
| e | 864139 | |
| l | 818093 | |
| s | 751532 | |
| n | 724092 | |
| d | 675059 | |
| u | 621407 | 6.4% |
| o | 381297 | 3.9% |
| Other values (11) | 1025385 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1264591 | |
| T | 783284 | |
| R | 643275 | |
| L | 643130 | |
| F | 637691 | |
| C | 196539 | 4.4% |
| P | 139852 | 3.1% |
| X | 87330 | 2.0% |
| I | 52720 | 1.2% |
| S | 5528 | 0.1% |
| Other values (5) | 1082 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2183944 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 113550 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 113550 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 91 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14134117 | |
| Common | 2411135 | 14.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1556761 | 11.0% |
| t | 1373816 | 9.7% |
| A | 1264591 | 8.9% |
| r | 887514 | 6.3% |
| e | 864139 | 6.1% |
| l | 818093 | 5.8% |
| T | 783284 | 5.5% |
| s | 751532 | 5.3% |
| n | 724092 | 5.1% |
| d | 675059 | 4.8% |
| Other values (26) | 4435236 |
Common
| Value | Count | Frequency (%) |
| 2183944 | ||
| ( | 113550 | 4.7% |
| ) | 113550 | 4.7% |
| - | 91 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16545252 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2183944 | ||
| i | 1556761 | 9.4% |
| t | 1373816 | 8.3% |
| A | 1264591 | 7.6% |
| r | 887514 | 5.4% |
| e | 864139 | 5.2% |
| l | 818093 | 4.9% |
| T | 783284 | 4.7% |
| s | 751532 | 4.5% |
| n | 724092 | 4.4% |
| Other values (30) | 5337486 |
| Distinct | 109 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| TDF(300mg)+3TC(300mg)+DTG(50mg) | |
|---|---|
| TDF(300mg)+3TC(300mg)+EFV(600mg) | |
| AZT(300mg)+3TC(150mg)+NVP(200mg) | |
| Cotrimoxazole 960mg | |
| Isoniazid 300mg | |
| Other values (104) |
Length
| Max length | 62 |
|---|---|
| Median length | 32 |
| Mean length | 29.6154547 |
| Min length | 10 |
Characters and Unicode
| Total characters | 22429235 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
|---|---|
| 2nd row | Isoniazid 300mg |
| 3rd row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
| 4th row | TDF(300mg)+3TC(300mg)+EFV(600mg) |
| 5th row | TDF(300mg)+3TC(300mg)+EFV(600mg) |
Common Values
| Value | Count | Frequency (%) |
| TDF(300mg)+3TC(300mg)+DTG(50mg) | 231946 | |
| TDF(300mg)+3TC(300mg)+EFV(600mg) | 227793 | |
| AZT(300mg)+3TC(150mg)+NVP(200mg) | 128173 | |
| Cotrimoxazole 960mg | 85378 | 11.3% |
| Isoniazid 300mg | 25465 | 3.4% |
| AZT(300mg)+3TC(150mg)+EFV(600mg) | 9576 | 1.3% |
| AZT(10mg/ml)+3TC(10mg/ml)+NVP(10mg/ml) | 7018 | 0.9% |
| TDF(300mg)+3TC(300mg)+NVP(200mg) | 5002 | 0.7% |
| ABC(60mg)+3TC(30mg)+LPV/r(40/10mg) | 3699 | 0.5% |
| TDF(300mg)+3TC(30mg)+DTG(50mg) | 3497 | 0.5% |
| Other values (99) | 29802 | 3.9% |
Length
| Value | Count | Frequency (%) |
| tdf(300mg)+3tc(300mg)+dtg(50mg | 231946 | |
| tdf(300mg)+3tc(300mg)+efv(600mg | 227793 | |
| azt(300mg)+3tc(150mg)+nvp(200mg | 128173 | |
| cotrimoxazole | 87803 | 10.1% |
| 960mg | 85378 | 9.8% |
| isoniazid | 26082 | 3.0% |
| 300mg | 25466 | 2.9% |
| azt(300mg)+3tc(150mg)+efv(600mg | 9576 | 1.1% |
| azt(10mg/ml)+3tc(10mg/ml)+nvp(10mg/ml | 7018 | 0.8% |
| tdf(300mg)+3tc(300mg)+nvp(200mg | 5002 | 0.6% |
| Other values (122) | 37520 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3565814 | |
| m | 2139609 | |
| g | 2026240 | |
| ( | 1910887 | 8.5% |
| ) | 1910887 | 8.5% |
| 3 | 1771226 | 7.9% |
| T | 1518806 | 6.8% |
| + | 1268000 | 5.7% |
| C | 739698 | 3.3% |
| F | 725335 | 3.2% |
| Other values (47) | 4852733 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6499858 | |
| Lowercase Letter | 5384861 | |
| Uppercase Letter | 5259123 | |
| Open Punctuation | 1910887 | 8.5% |
| Close Punctuation | 1910887 | 8.5% |
| Math Symbol | 1268000 | 5.7% |
| Space Separator | 114408 | 0.5% |
| Other Punctuation | 81211 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 2139609 | |
| g | 2026240 | |
| o | 290407 | 5.4% |
| i | 140738 | 2.6% |
| a | 114622 | 2.1% |
| z | 114352 | 2.1% |
| l | 113913 | 2.1% |
| r | 97627 | 1.8% |
| e | 88490 | 1.6% |
| t | 88243 | 1.6% |
| Other values (12) | 170620 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1518806 | |
| C | 739698 | |
| F | 725335 | |
| D | 714998 | |
| V | 403884 | 7.7% |
| E | 243162 | 4.6% |
| G | 238131 | 4.5% |
| A | 166635 | 3.2% |
| P | 158743 | 3.0% |
| Z | 156095 | 3.0% |
| Other values (9) | 193636 | 3.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3565814 | |
| 3 | 1771226 | |
| 5 | 396126 | 6.1% |
| 6 | 336577 | 5.2% |
| 1 | 180403 | 2.8% |
| 2 | 153374 | 2.4% |
| 9 | 85378 | 1.3% |
| 4 | 8959 | 0.1% |
| 8 | 1726 | < 0.1% |
| 7 | 275 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 81209 | |
| , | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1910887 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1910887 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1268000 |
Space Separator
| Value | Count | Frequency (%) |
| 114408 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11785251 | |
| Latin | 10643984 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 2139609 | |
| g | 2026240 | |
| T | 1518806 | |
| C | 739698 | 6.9% |
| F | 725335 | 6.8% |
| D | 714998 | 6.7% |
| V | 403884 | 3.8% |
| o | 290407 | 2.7% |
| E | 243162 | 2.3% |
| G | 238131 | 2.2% |
| Other values (31) | 1603714 |
Common
| Value | Count | Frequency (%) |
| 0 | 3565814 | |
| ( | 1910887 | |
| ) | 1910887 | |
| 3 | 1771226 | |
| + | 1268000 | 10.8% |
| 5 | 396126 | 3.4% |
| 6 | 336577 | 2.9% |
| 1 | 180403 | 1.5% |
| 2 | 153374 | 1.3% |
| 114408 | 1.0% | |
| Other values (6) | 177549 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22429235 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3565814 | |
| m | 2139609 | |
| g | 2026240 | |
| ( | 1910887 | 8.5% |
| ) | 1910887 | 8.5% |
| 3 | 1771226 | 7.9% |
| T | 1518806 | 6.8% |
| + | 1268000 | 5.7% |
| C | 739698 | 3.3% |
| F | 725335 | 3.2% |
| Other values (47) | 4852733 |
| Distinct | 757349 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1910151.303 |
| Minimum | 209355 |
|---|---|
| Maximum | 4080353 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 209355 |
|---|---|
| 5-th percentile | 398867.4 |
| Q1 | 768582 |
| median | 1750427 |
| Q3 | 3166281 |
| 95-th percentile | 3804175.6 |
| Maximum | 4080353 |
| Range | 3870998 |
| Interquartile range (IQR) | 2397699 |
Descriptive statistics
| Standard deviation | 1236196.862 |
|---|---|
| Coefficient of variation (CV) | 0.6471722212 |
| Kurtosis | -1.332274199 |
| Mean | 1910151.303 |
| Median Absolute Deviation (MAD) | 1109174 |
| Skewness | 0.4037218083 |
| Sum | 1.446651179 × 1012 |
| Variance | 1.528182681 × 1012 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 3147777 | 1 | < 0.1% |
| 3709322 | 1 | < 0.1% |
| 3224435 | 1 | < 0.1% |
| 1121142 | 1 | < 0.1% |
| 1619509 | 1 | < 0.1% |
| 3712098 | 1 | < 0.1% |
| 3728417 | 1 | < 0.1% |
| 3748115 | 1 | < 0.1% |
| 3098498 | 1 | < 0.1% |
| 1137534 | 1 | < 0.1% |
| Other values (757339) | 757339 |
| Value | Count | Frequency (%) |
| 209355 | 1 | |
| 209362 | 1 | |
| 209368 | 1 | |
| 209375 | 1 | |
| 209382 | 1 | |
| 209389 | 1 | |
| 209396 | 1 | |
| 209402 | 1 | |
| 209407 | 1 | |
| 209415 | 1 |
| Value | Count | Frequency (%) |
| 4080353 | 1 | |
| 4080352 | 1 | |
| 4080351 | 1 | |
| 4080350 | 1 | |
| 4080349 | 1 | |
| 4080347 | 1 | |
| 4080346 | 1 | |
| 4080345 | 1 | |
| 4080344 | 1 | |
| 4080343 | 1 |
| Distinct | 23093 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69294.81317 |
| Minimum | 37869 |
|---|---|
| Maximum | 160842 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 37869 |
|---|---|
| 5-th percentile | 38903.4 |
| Q1 | 43760 |
| median | 49270 |
| Q3 | 55171 |
| 95-th percentile | 153578 |
| Maximum | 160842 |
| Range | 122973 |
| Interquartile range (IQR) | 11411 |
Descriptive statistics
| Standard deviation | 42509.01086 |
|---|---|
| Coefficient of variation (CV) | 0.6134515545 |
| Kurtosis | -0.1400648017 |
| Mean | 69294.81317 |
| Median Absolute Deviation (MAD) | 5699 |
| Skewness | 1.322614282 |
| Sum | 5.248035746 × 1010 |
| Variance | 1807016004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 150928 | 352 | < 0.1% |
| 49325 | 234 | < 0.1% |
| 154039 | 221 | < 0.1% |
| 49434 | 211 | < 0.1% |
| 150993 | 198 | < 0.1% |
| 153889 | 196 | < 0.1% |
| 151667 | 190 | < 0.1% |
| 44837 | 185 | < 0.1% |
| 49602 | 185 | < 0.1% |
| 46359 | 184 | < 0.1% |
| Other values (23083) | 755193 |
| Value | Count | Frequency (%) |
| 37869 | 29 | |
| 37870 | 62 | |
| 37871 | 21 | < 0.1% |
| 37872 | 22 | < 0.1% |
| 37873 | 17 | < 0.1% |
| 37874 | 8 | < 0.1% |
| 37875 | 13 | < 0.1% |
| 37876 | 14 | < 0.1% |
| 37877 | 15 | < 0.1% |
| 37878 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 160842 | 5 | |
| 160841 | 5 | |
| 160840 | 5 | |
| 160738 | 4 | |
| 160737 | 3 | |
| 160732 | 4 | |
| 160731 | 4 | |
| 160728 | 4 | |
| 160671 | 4 | |
| 160670 | 4 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 840.2045517 |
| Minimum | 421 |
|---|---|
| Maximum | 2887 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 421 |
|---|---|
| 5-th percentile | 425 |
| Q1 | 433 |
| median | 434 |
| Q3 | 436 |
| 95-th percentile | 2881 |
| Maximum | 2887 |
| Range | 2466 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 911.882779 |
|---|---|
| Coefficient of variation (CV) | 1.085310449 |
| Kurtosis | 1.209708576 |
| Mean | 840.2045517 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.79153985 |
| Sum | 636328077 |
| Variance | 831530.2026 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 434 | 316231 | |
| 436 | 120950 | 16.0% |
| 2881 | 116869 | 15.4% |
| 433 | 91173 | 12.0% |
| 426 | 57359 | 7.6% |
| 425 | 38373 | 5.1% |
| 421 | 7252 | 1.0% |
| 2884 | 5737 | 0.8% |
| 2887 | 3236 | 0.4% |
| 2886 | 169 | < 0.1% |
| Value | Count | Frequency (%) |
| 421 | 7252 | 1.0% |
| 425 | 38373 | 5.1% |
| 426 | 57359 | 7.6% |
| 433 | 91173 | 12.0% |
| 434 | 316231 | |
| 436 | 120950 | 16.0% |
| 2881 | 116869 | 15.4% |
| 2884 | 5737 | 0.8% |
| 2886 | 169 | < 0.1% |
| 2887 | 3236 | 0.4% |
| Value | Count | Frequency (%) |
| 2887 | 3236 | 0.4% |
| 2886 | 169 | < 0.1% |
| 2884 | 5737 | 0.8% |
| 2881 | 116869 | 15.4% |
| 436 | 120950 | 16.0% |
| 434 | 316231 | |
| 433 | 91173 | 12.0% |
| 426 | 57359 | 7.6% |
| 425 | 38373 | 5.1% |
| 421 | 7252 | 1.0% |
DURATION
Real number (ℝ≥0)
| Distinct | 157 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 73.24852083 |
| Minimum | 0 |
|---|---|
| Maximum | 9168 |
| Zeros | 2509 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 60 |
| median | 60 |
| Q3 | 90 |
| 95-th percentile | 180 |
| Maximum | 9168 |
| Range | 9168 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 45.36618966 |
|---|---|
| Coefficient of variation (CV) | 0.619346154 |
| Kurtosis | 2544.775317 |
| Mean | 73.24852083 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.52463523 |
| Sum | 55474694 |
| Variance | 2058.091164 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 381801 | |
| 90 | 115506 | 15.3% |
| 30 | 97128 | 12.8% |
| 180 | 73124 | 9.7% |
| 120 | 27109 | 3.6% |
| 15 | 25635 | 3.4% |
| 56 | 8542 | 1.1% |
| 14 | 6526 | 0.9% |
| 168 | 5065 | 0.7% |
| 84 | 4739 | 0.6% |
| Other values (147) | 12174 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 2509 | |
| 1 | 135 | < 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 14 | < 0.1% |
| 5 | 16 | < 0.1% |
| 6 | 36 | < 0.1% |
| 7 | 474 | 0.1% |
| 8 | 48 | < 0.1% |
| 9 | 25 | < 0.1% |
| 10 | 72 | < 0.1% |
| Value | Count | Frequency (%) |
| 9168 | 1 | < 0.1% |
| 6028 | 1 | < 0.1% |
| 2019 | 1 | < 0.1% |
| 1801 | 1 | < 0.1% |
| 1800 | 2 | < 0.1% |
| 990 | 2 | < 0.1% |
| 980 | 1 | < 0.1% |
| 960 | 13 | |
| 909 | 3 | < 0.1% |
| 901 | 1 | < 0.1% |
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6644295761 |
| Minimum | 0 |
|---|---|
| Maximum | 960 |
| Zeros | 301222 |
| Zeros (%) | 39.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 960 |
| Range | 960 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.376783526 |
|---|---|
| Coefficient of variation (CV) | 5.0822294 |
| Kurtosis | 34502.76255 |
| Mean | 0.6644295761 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 160.802169 |
| Sum | 503205.075 |
| Variance | 11.40266698 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 445006 | |
| 0 | 301222 | |
| 3 | 8378 | 1.1% |
| 2 | 2529 | 0.3% |
| 90 | 96 | < 0.1% |
| 180 | 49 | < 0.1% |
| 60 | 10 | < 0.1% |
| 601 | 9 | < 0.1% |
| 15 | 7 | < 0.1% |
| 120 | 5 | < 0.1% |
| Other values (22) | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 301222 | |
| 0.09 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.18 | 1 | < 0.1% |
| 1 | 445006 | |
| 1.015 | 1 | < 0.1% |
| 1.03 | 3 | < 0.1% |
| 1.05 | 2 | < 0.1% |
| 1.056 | 1 | < 0.1% |
| 1.06 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 960 | 1 | < 0.1% |
| 901 | 2 | < 0.1% |
| 601 | 9 | < 0.1% |
| 301 | 1 | < 0.1% |
| 180 | 49 | |
| 151 | 3 | < 0.1% |
| 120 | 5 | < 0.1% |
| 90 | 96 | |
| 61 | 1 | < 0.1% |
| 60 | 10 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| 0 | |
|---|---|
| 1 | 6 |
| 90 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.00000132 |
| Min length | 1 |
Characters and Unicode
| Total characters | 757350 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 757342 | |
| 1 | 6 | < 0.1% |
| 90 | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 757342 | |
| 1 | 6 | < 0.1% |
| 90 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 757343 | |
| 1 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 757350 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 757343 | |
| 1 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 757350 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 757343 | |
| 1 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 757350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 757343 | |
| 1 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7577864366 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 201465 |
| Zeros (%) | 26.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5241983383 |
|---|---|
| Coefficient of variation (CV) | 0.6917494336 |
| Kurtosis | 1511.570952 |
| Mean | 0.7577864366 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.58142663 |
| Sum | 573908.8 |
| Variance | 0.2747838979 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 546546 | |
| 0 | 201465 | 26.6% |
| 3 | 8389 | 1.1% |
| 2 | 932 | 0.1% |
| 0.5 | 3 | < 0.1% |
| 15 | 3 | < 0.1% |
| 2.5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 56 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 201465 | 26.6% |
| 0.3 | 1 | < 0.1% |
| 0.5 | 3 | < 0.1% |
| 1 | 546546 | |
| 2 | 932 | 0.1% |
| 2.5 | 2 | < 0.1% |
| 3 | 8389 | 1.1% |
| 4 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 15 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 90 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 15 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 8389 | |
| 2.5 | 2 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 79955 |
| Missing (%) | 10.6% |
| Memory size | 1.4 MiB |
| False | |
|---|---|
| True | 2355 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 675039 | |
| True | 2355 | 0.3% |
| (Missing) | 79955 | 10.6% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 20.5% |
| Missing | 757310 |
| Missing (%) | > 99.9% |
| Memory size | 5.8 MiB |
| 1,1,4# , , | |
|---|---|
| 4,2# ,1 | |
| 6#1 | |
| 1#1 | |
| 4#4 | |
| Other values (3) |
Length
| Max length | 16 |
|---|---|
| Median length | 3 |
| Mean length | 6.076923077 |
| Min length | 2 |
Characters and Unicode
| Total characters | 237 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4,2# ,1 |
|---|---|
| 2nd row | 4,2# ,1 |
| 3rd row | 4,2# ,1 |
| 4th row | 4,2# ,1 |
| 5th row | 4,2# ,1 |
Common Values
| Value | Count | Frequency (%) |
| 1,1,4# , , | 8 | < 0.1% |
| 4,2# ,1 | 7 | < 0.1% |
| 6#1 | 6 | < 0.1% |
| 1#1 | 4 | < 0.1% |
| 4#4 | 4 | < 0.1% |
| 4,3 | 4 | < 0.1% |
| 4,3#6,4#8,4#10,3 | 3 | < 0.1% |
| 1# | 3 | < 0.1% |
| (Missing) | 757310 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 16 | ||
| 1 | 10 | |
| 1,1,4 | 8 | |
| 4,2 | 7 | |
| 6#1 | 6 | 9.7% |
| 1#1 | 4 | 6.5% |
| 4,3 | 4 | 6.5% |
| 4#4 | 4 | 6.5% |
| 4,3#6,4#8,4#10,3 | 3 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 62 | |
| 1 | 43 | |
| # | 41 | |
| 4 | 36 | |
| 23 | 9.7% | |
| 3 | 10 | 4.2% |
| 6 | 9 | 3.8% |
| 2 | 7 | 3.0% |
| 8 | 3 | 1.3% |
| 0 | 3 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 111 | |
| Other Punctuation | 103 | |
| Space Separator | 23 | 9.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 43 | |
| 4 | 36 | |
| 3 | 10 | 9.0% |
| 6 | 9 | 8.1% |
| 2 | 7 | 6.3% |
| 8 | 3 | 2.7% |
| 0 | 3 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 62 | |
| # | 41 |
Space Separator
| Value | Count | Frequency (%) |
| 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 237 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| , | 62 | |
| 1 | 43 | |
| # | 41 | |
| 4 | 36 | |
| 23 | 9.7% | |
| 3 | 10 | 4.2% |
| 6 | 9 | 3.8% |
| 2 | 7 | 3.0% |
| 8 | 3 | 1.3% |
| 0 | 3 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 237 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| , | 62 | |
| 1 | 43 | |
| # | 41 | |
| 4 | 36 | |
| 23 | 9.7% | |
| 3 | 10 | 4.2% |
| 6 | 9 | 3.8% |
| 2 | 7 | 3.0% |
| 8 | 3 | 1.3% |
| 0 | 3 | 1.3% |
PRESCRIP_ERROR
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| 0 | |
|---|---|
| 1 | 3912 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 757349 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 753437 | |
| 1 | 3912 | 0.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 753437 | |
| 1 | 3912 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 753437 | |
| 1 | 3912 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 757349 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 753437 | |
| 1 | 3912 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 757349 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 753437 | |
| 1 | 3912 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 757349 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 753437 | |
| 1 | 3912 | 0.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 757349 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 388667 | |
| 0 | 368682 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 388667 | |
| 0 | 368682 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 388667 | |
| 0 | 368682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 757349 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 388667 | |
| 0 | 368682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 757349 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 388667 | |
| 0 | 368682 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 757349 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 388667 | |
| 0 | 368682 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 578741 |
| Missing (%) | 76.4% |
| Memory size | 5.8 MiB |
| Same Facility Refill | |
|---|---|
| MMD | |
| Individual delivery/home-based | 5562 |
| MMS | 3800 |
| Different Facility Refill (Private hospital/clinic) | 1085 |
| Other values (6) | 169 |
Length
| Max length | 51 |
|---|---|
| Median length | 20 |
| Mean length | 13.78559751 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2462218 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MMS |
|---|---|
| 2nd row | MMS |
| 3rd row | MMS |
| 4th row | MMS |
| 5th row | MMD |
Common Values
| Value | Count | Frequency (%) |
| Same Facility Refill | 101317 | 13.4% |
| MMD | 66675 | 8.8% |
| Individual delivery/home-based | 5562 | 0.7% |
| MMS | 3800 | 0.5% |
| Different Facility Refill (Private hospital/clinic) | 1085 | 0.1% |
| PMVs/Chemists | 56 | < 0.1% |
| Other | 36 | < 0.1% |
| CPARP | 33 | < 0.1% |
| Fixed or ad hoc pick up points | 24 | < 0.1% |
| Mobile van/other vehicle | 17 | < 0.1% |
| (Missing) | 578741 |
Length
| Value | Count | Frequency (%) |
| facility | 102402 | |
| refill | 102402 | |
| same | 101317 | |
| mmd | 66675 | |
| individual | 5562 | 1.4% |
| delivery/home-based | 5562 | 1.4% |
| mms | 3800 | 1.0% |
| private | 1085 | 0.3% |
| different | 1085 | 0.3% |
| hospital/clinic | 1085 | 0.3% |
| Other values (15) | 350 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 329482 | |
| l | 320537 | |
| e | 229412 | 9.3% |
| a | 217054 | 8.8% |
| 212717 | 8.6% | |
| M | 141023 | 5.7% |
| y | 107967 | 4.4% |
| m | 106935 | 4.3% |
| t | 105790 | 4.3% |
| S | 105117 | 4.3% |
| Other values (28) | 586184 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1709287 | |
| Uppercase Letter | 525759 | 21.4% |
| Space Separator | 212717 | 8.6% |
| Other Punctuation | 6723 | 0.3% |
| Dash Punctuation | 5562 | 0.2% |
| Open Punctuation | 1085 | < 0.1% |
| Close Punctuation | 1085 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 329482 | |
| l | 320537 | |
| e | 229412 | |
| a | 217054 | |
| y | 107967 | 6.3% |
| m | 106935 | 6.3% |
| t | 105790 | 6.2% |
| c | 104637 | 6.1% |
| f | 104572 | 6.1% |
| d | 22299 | 1.3% |
| Other values (11) | 60602 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 141023 | |
| S | 105117 | |
| R | 102438 | |
| F | 102426 | |
| D | 67760 | |
| I | 5562 | 1.1% |
| P | 1207 | 0.2% |
| C | 95 | < 0.1% |
| V | 56 | < 0.1% |
| A | 36 | < 0.1% |
| Other values (2) | 39 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 212717 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1085 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6723 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1085 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5562 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2235046 | |
| Common | 227172 | 9.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 329482 | |
| l | 320537 | |
| e | 229412 | |
| a | 217054 | |
| M | 141023 | 6.3% |
| y | 107967 | 4.8% |
| m | 106935 | 4.8% |
| t | 105790 | 4.7% |
| S | 105117 | 4.7% |
| c | 104637 | 4.7% |
| Other values (23) | 467092 |
Common
| Value | Count | Frequency (%) |
| 212717 | ||
| / | 6723 | 3.0% |
| - | 5562 | 2.4% |
| ( | 1085 | 0.5% |
| ) | 1085 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2462218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 329482 | |
| l | 320537 | |
| e | 229412 | 9.3% |
| a | 217054 | 8.8% |
| 212717 | 8.6% | |
| M | 141023 | 5.7% |
| y | 107967 | 4.4% |
| m | 106935 | 4.3% |
| t | 105790 | 4.3% |
| S | 105117 | 4.3% |
| Other values (28) | 586184 |
| Distinct | 83 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.07154842748 |
| Minimum | 0 |
|---|---|
| Maximum | 85 |
| Zeros | 755118 |
| Zeros (%) | 99.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 85 |
| Range | 85 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.557751708 |
|---|---|
| Coefficient of variation (CV) | 21.77199084 |
| Kurtosis | 956.14684 |
| Mean | 0.07154842748 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 28.28863313 |
| Sum | 54187.13 |
| Variance | 2.426590382 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 755118 | |
| 30 | 187 | < 0.1% |
| 20 | 125 | < 0.1% |
| 15 | 125 | < 0.1% |
| 10 | 113 | < 0.1% |
| 25 | 105 | < 0.1% |
| 18 | 102 | < 0.1% |
| 14 | 87 | < 0.1% |
| 13 | 77 | < 0.1% |
| 54 | 74 | < 0.1% |
| Other values (73) | 1236 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 755118 | |
| 1.5 | 10 | < 0.1% |
| 1.6 | 26 | < 0.1% |
| 3 | 2 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 32 | < 0.1% |
| 6.2 | 4 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 43 | < 0.1% |
| 8.05 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 85 | 4 | < 0.1% |
| 83 | 3 | < 0.1% |
| 80 | 7 | |
| 78 | 3 | < 0.1% |
| 75 | 4 | < 0.1% |
| 74 | 3 | < 0.1% |
| 72 | 7 | |
| 69 | 3 | < 0.1% |
| 68 | 3 | < 0.1% |
| 67 | 13 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| State | L.G.A | Facility Name | Regimen Line | Regimen | PHARMACY_ID | PATIENT_ID | FACILITY_ID | DATE_VISIT | DURATION | MORNING | AFTERNOON | EVENING | ADR_SCREENED | ADR_IDS | PRESCRIP_ERROR | ADHERENCE | NEXT_APPOINTMENT | DMOC_TYPE | BODY_WEIGHT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 209355 | 37992 | 421 | 2019-11-15 00:00:00 | 30 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2019-12-10 00:00:00 | NaN | 0.0 |
| 1 | Adamawa | Girei | Girei B Clinic | Isoniazid Preventive Therapy (IPT) | Isoniazid 300mg | 209362 | 37987 | 421 | 2020-02-21 00:00:00 | 56 | 1.0 | 0 | 0.0 | No | NaN | 0 | 0 | 2020-04-15 00:00:00 | NaN | 0.0 |
| 2 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 209368 | 38101 | 421 | 2019-11-18 00:00:00 | 60 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2020-01-13 00:00:00 | NaN | 0.0 |
| 3 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+EFV(600mg) | 209375 | 37937 | 421 | 2018-10-02 00:00:00 | 60 | 0.0 | 0 | 1.0 | NaN | NaN | 0 | 0 | 2018-11-02 00:00:00 | NaN | 0.0 |
| 4 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+EFV(600mg) | 209382 | 37870 | 421 | 2019-06-03 00:00:00 | 60 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2019-08-02 00:00:00 | NaN | 0.0 |
| 5 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 209389 | 38094 | 421 | 2020-02-27 00:00:00 | 30 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2020-03-27 00:00:00 | NaN | 0.0 |
| 6 | Adamawa | Girei | Girei B Clinic | Cotrimoxazole (CTX) Prophylaxis | Cotrimoxazole 960mg | 209396 | 37976 | 421 | 2019-01-24 00:00:00 | 30 | 1.0 | 0 | 0.0 | No | NaN | 0 | 1 | 2019-02-21 00:00:00 | NaN | 0.0 |
| 7 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 209402 | 38024 | 421 | 2020-03-23 00:00:00 | 90 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2020-06-19 00:00:00 | NaN | 0.0 |
| 8 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+EFV(600mg) | 209407 | 37870 | 421 | 2017-06-26 00:00:00 | 60 | 0.0 | 0 | 1.0 | NaN | NaN | 0 | 0 | 2017-08-27 00:00:00 | NaN | 0.0 |
| 9 | Adamawa | Girei | Girei B Clinic | ART First Line Adult | TDF(300mg)+3TC(300mg)+EFV(600mg) | 209415 | 38031 | 421 | 2019-02-22 00:00:00 | 30 | 1.0 | 0 | 1.0 | NaN | NaN | 0 | 0 | 2019-03-21 00:00:00 | NaN | 0.0 |
Last rows
| State | L.G.A | Facility Name | Regimen Line | Regimen | PHARMACY_ID | PATIENT_ID | FACILITY_ID | DATE_VISIT | DURATION | MORNING | AFTERNOON | EVENING | ADR_SCREENED | ADR_IDS | PRESCRIP_ERROR | ADHERENCE | NEXT_APPOINTMENT | DMOC_TYPE | BODY_WEIGHT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 757339 | Adamawa | Mubi South | Mubi General Hospital | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4080343 | 51773 | 434 | 2021-05-28 00:00:00 | 180 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-11-10 00:00:00 | Same Facility Refill | 0.0 |
| 757340 | Adamawa | Mubi South | Mubi General Hospital | ART Second Line Adult | TDF(300mg)+3TC(150mg)+ATV/r(300/100mg) | 4080344 | 51530 | 434 | 2021-05-28 00:00:00 | 90 | 1.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-08-20 00:00:00 | Same Facility Refill | 0.0 |
| 757341 | Adamawa | Mubi South | Mubi General Hospital | Cotrimoxazole (CTX) Prophylaxis | Cotrimoxazole 960mg | 4080345 | 48506 | 434 | 2021-04-16 00:00:00 | 90 | 1.0 | 0 | 0.0 | No | NaN | 0 | 0 | 2021-10-22 00:00:00 | Same Facility Refill | 0.0 |
| 757342 | Adamawa | Mubi South | Mubi General Hospital | Cotrimoxazole (CTX) Prophylaxis | Cotrimoxazole 960mg | 4080346 | 159222 | 434 | 2021-04-28 00:00:00 | 90 | 1.0 | 0 | 0.0 | No | NaN | 0 | 0 | 2021-07-21 00:00:00 | Same Facility Refill | 0.0 |
| 757343 | Adamawa | Mubi South | Mubi General Hospital | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4080347 | 45527 | 434 | 2021-05-27 00:00:00 | 90 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-07-21 00:00:00 | Same Facility Refill | 0.0 |
| 757344 | Adamawa | Mubi South | Mubi General Hospital | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4080349 | 50854 | 434 | 2021-05-28 00:00:00 | 90 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-08-20 00:00:00 | Same Facility Refill | 0.0 |
| 757345 | Adamawa | Mubi South | Mubi General Hospital | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4080350 | 45080 | 434 | 2021-05-28 00:00:00 | 180 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-11-12 00:00:00 | Same Facility Refill | 0.0 |
| 757346 | Adamawa | Mubi South | Mubi General Hospital | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4080351 | 47811 | 434 | 2021-05-28 00:00:00 | 180 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-11-12 00:00:00 | Same Facility Refill | 0.0 |
| 757347 | Adamawa | Mubi South | Mubi General Hospital | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4080352 | 44969 | 434 | 2021-05-28 00:00:00 | 180 | 0.0 | 0 | 1.0 | No | NaN | 0 | 0 | 2021-11-12 00:00:00 | Same Facility Refill | 0.0 |
| 757348 | Adamawa | Mubi South | Mubi General Hospital | Isoniazid Preventive Therapy (IPT) | Isoniazid 300mg | 4080353 | 159222 | 434 | 2021-04-28 00:00:00 | 84 | 1.0 | 0 | 0.0 | No | NaN | 0 | 0 | 2021-07-21 00:00:00 | Same Facility Refill | 0.0 |